Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🔲 Loop Tiling
Cache Optimization, Blocking, Matrix Multiplication, Locality
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82987
posts in
443.6
ms
KV-CoRE: Benchmarking Data-Dependent Low-Rank
Compressibility
of
KV-Caches
in LLMs
arxiv.org
·
1d
🎯
Tensor Cores
Intersection
of Two Linked
Lists
dev.to
·
16h
·
Discuss:
DEV
🔀
Operator Fusion
CUDA
Guide:
Workflow
for Performance Tuning
digitalocean.com
·
2d
⚡
CUDA Programming Patterns
Processes and Threads -
Discourse
on
Concurrency
, Part I
ayanmali.substack.com
·
21h
·
Discuss:
Substack
⚡
CUDA Programming Patterns
The Risk of Not
Optimizing
Clock
Power
semiwiki.com
·
19h
🎛️
CUDA Optimization
I Built a 6
BIPS
JIT
in Five Months
unlikelyemphasis.substack.com
·
1d
·
Discuss:
Substack
🚀
Compiler Optimization
SPPAM
: Signature Pattern Prediction and Access-Map
Prefetcher
arxiv.org
·
2d
🎛️
CUDA Optimization
An Analysis of User-space
Idle
State Instructions on
x86
Processors
danglingpointers.substack.com
·
1d
·
Discuss:
Substack
🧠
CPU Architecture
Continual
learning and the post
monolith
AI era
baseten.co
·
11h
·
Discuss:
Hacker News
📊
Gradient Accumulation
How Virtual
Textures
Really Work
shlom.dev
·
19h
·
Discuss:
Hacker News
📈
GPU Occupancy
— ### Abstract In the framework of the
AdS/CFT
correspondence, the entanglement wedge reconstruction problem—mapping quantum information on the
conform
...
freederia.com
·
16h
✂️
CUTLASS
Fast
Autoscheduling
for Sparse ML
Frameworks
ajroot.pl
·
2d
·
Discuss:
Hacker News
🎯
Tensor Cores
Stratum
: Architecting a
Configurable
Cache Simulator with C++ and Racket
thecloudlet.github.io
·
5d
·
Discuss:
Hacker News
🧠
CPU Architecture
Concurrency Flavours --
Lucian
Radu
Teodorescu
: Standard C++
isocpp.org
·
2d
⚡
CUDA Programming Patterns
Examining
Turbopuffer
ANN v3
terencezl.github.io
·
1d
·
Discuss:
Hacker News
📊
Profiling Tools
Binary
Search Trees: Why They’re Great in Memory but Terrible on
Disk
dev.to
·
1d
·
Discuss:
DEV
📈
Occupancy Optimization
[$]
Modernizing
swapping
: the end of the swap map
lwn.net
·
1d
⚡
CUDA Programming Patterns
Flow-Based Programming:
Seminal
Texts and
Theoretical
Foundations
repolex.ai
·
1d
·
Discuss:
Hacker News
💡
LSP
Taming the Regex Monster: Optimizing Massive
Literal
Alternations
modern-c.blogspot.com
·
1d
·
Discuss:
r/golang
🔍
Type Checkers
Hello Edge: Keyword
Spotting
on
Microcontrollers
paperium.net
·
12h
·
Discuss:
DEV
🎯
Tensor Cores
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help